Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome
نویسندگان
چکیده
The popular Genome-wide Complex Trait Analysis (GCTA) software uses the random-effects models for estimating the narrow-sense heritability based on GWAS data of unrelated individuals without knowing and identifying the causal loci. Many methods have since extended this approach to various situations. However, since the proportion of causal loci among the variants is typically very small and GCTA uses all variants to calculate the similarities among individuals, the estimation of heritability may be unstable, resulting in a large variance of the estimates. Moreover, if the causal SNPs are not genotyped, GCTA sometimes greatly underestimates the true heritability. We present a novel narrow-sense heritability estimator, named HERRA, using well-developed ultra-high dimensional machine-learning methods, applicable to continuous or dichotomous outcomes, as other existing methods. Additionally, HERRA is applicable to time-to-event or age-at-onset outcome, which, to our knowledge, no existing method can handle. Compared to GCTA and LDAK for continuous and binary outcomes, HERRA often has a smaller variance, and when causal SNPs are not genotyped, HERRA has a much smaller empirical bias. We applied GCTA, LDAK and HERRA to a large colorectal cancer dataset using dichotomous outcome (4,312 cases, 4,356 controls, genotyped using Illumina 300K), the respective heritability estimates of GCTA, LDAK and HERRA are 0.068 (SE = 0.017), 0.072 (SE = 0.021) and 0.110 (SE = 5.19 x 10-3). HERRA yields over 50% increase in heritability estimate compared to GCTA or LDAK.
منابع مشابه
Heritability Estimation using Regularized Regression Approach (HERRA)
Heritability is a concept that summarizes the proportion of phenotypic variance that is due to genetic factors, with broad-sense heritability referring to genetic variation that may include effects due to dominance and epistasis, and narrow-sense heritability referring to additive genetic variation. The popular GCTA software uses the random effects approach to estimate the narrow-sense heritabi...
متن کاملEstimation of Variance Components for Body Weight of Moghani Sheep Using B-Spline Random Regression Models
The aim of the present study was the estimation of (co) variance components and genetic parameters for body weight of Moghani sheep, using random regression models based on B-Splines functions. The data set included 9165 body weight records from 60 to 360 days of age from 2811 Moghani sheep, collected between 1994 to 2013 from Jafar-Abad Animal Research and Breeding Institute, Ardabil province,...
متن کاملEvaluation of the effective factors on Bipolar I Disorder frequent recurrence in a 5 years longitudinal study using generalized estimation equations method
Background and Purpose: Patients with Bipolar I Disorder recurrence experiences mood variation between manic and depression during the time. Hence, that is need to the longitudinal study on Bipolar Disorder patients. This study aims to evaluate the effective factors on Bipolar I Disorder frequent recurrence in 5 years longitudinal study using generalized estimation equations (GEE) m...
متن کاملSTUDY OF THE ASSOCIATION BETWEEN ACTIVITY LEVEL AT ONSET OF SYMPTOMS AND PATIENT OUTCOME OF F IRST ACUTE MYOCARDIAL INFARCTION
This study sought to compare the clinical features and outcome of a first acute myocardial infarction (AMI) with onset of symptoms during or within 30 minutes of exercise, at rest and in bed. Information collected using a standard questionnaire was used to relate activity at the onset of symptoms and in-hospital outcome in 500 consecutive patients admitted to our heart center with a first ...
متن کاملRandom regression models for estimation of covariance functions of growth in Iranian Kurdi sheep
Body weight (BW) records (n=11,659) of 4961 Kurdi sheep from 215 sires and 2085 dams were used to estimate the additive genetic, direct and maternal permanent environmental effects on growth from 1 to 300 days of age. The data were collected from 1993 to 2015 at a breeding station in North Khorasan province; Iran. Genetic parameters for growth traits were estimated using random regression test-...
متن کامل